rank | frequency | n-gram |
---|---|---|
1 | 5892 | -ી |
2 | 5320 | -ા |
3 | 4425 | -ં |
4 | 3044 | -ે |
5 | 3008 | -ો |
rank | frequency | n-gram |
---|---|---|
1 | 2900 | -ાં |
2 | 2272 | -ની |
3 | 2235 | -ના |
4 | 1804 | -ને |
5 | 1400 | -ું |
rank | frequency | n-gram |
---|---|---|
1 | 2285 | -માં |
2 | 910 | -નું |
3 | 401 | -ાની |
4 | 374 | -નાં |
5 | 316 | -ાના |
rank | frequency | n-gram |
---|---|---|
1 | 460 | -ામાં |
2 | 201 | -ાંથી |
3 | 191 | -ાનું |
4 | 174 | -ોમાં |
5 | 165 | -રમાં |
rank | frequency | n-gram |
---|---|---|
1 | 193 | -માંથી |
2 | 191 | -વામાં |
3 | 107 | -િલ્લો |
4 | 59 | -વાનું |
5 | 43 | -યામાં |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings